morteza haghir chehreghani
Cold-Start Active Correlation Clustering
Aronsson, Linus, Wu, Han, Chehreghani, Morteza Haghir
We study active correlation clustering where pairwise similarities are not provided upfront and must be queried in a cost-efficient manner through active learning. Specifically, we focus on the cold-start scenario, where no true initial pairwise similarities are available for active learning. To address this challenge, we propose a coverage-aware method that encourages diversity early in the process. We demonstrate the effectiveness of our approach through several synthetic and real-world experiments.
A Deep Learning Framework for Generation and Analysis of Driving Scenario Trajectories
Demetriou, Andreas, Alfsvåg, Henrik, Rahrovani, Sadegh, Chehreghani, Morteza Haghir
We propose a unified deep learning framework for the generation and analysis of driving scenario trajectories, and validate its effectiveness in a principled way. To model and generate scenarios of trajectories with different lengths, we develop two approaches. First, we adapt the Recurrent Conditional Generative Adversarial Networks (RC-GAN) by conditioning on the length of the trajectories. This provides us the flexibility to generate variable-length driving trajectories, a desirable feature for scenario test case generation in the verification of autonomous driving. Second, we develop an architecture based on Recurrent Autoencoder with GANs to obviate the variable length issue, wherein we train a GAN to learn/generate the latent representations of original trajectories. In this approach, we train an integrated feed-forward neural network to estimate the length of the trajectories to be able to bring them back from the latent space representation. In addition to trajectory generation, we employ the trained autoencoder as a feature extractor, for the purpose of clustering and anomaly detection, to obtain further insights into the collected scenario dataset. We experimentally investigate the performance of the proposed framework on real-world scenario trajectories obtained from in-field data collection.
- Transportation (0.35)
- Automobiles & Trucks (0.35)
Prediction of Time and Distance of Trips Using Explainable Attention-based LSTMs
Balouji, Ebrahim, Sjöblom, Jonas, Murgovski, Nikolce, Chehreghani, Morteza Haghir
In this paper, we propose machine learning solutions to predict the time of future trips and the possible distance the vehicle will travel. For this prediction task, we develop and investigate four methods. In the first method, we use long short-term memory (LSTM)-based structures specifically designed to handle multi-dimensional historical data of trip time and distances simultaneously. Using it, we predict the future trip time and forecast the distance a vehicle will travel by concatenating the outputs of LSTM networks through fully connected layers. The second method uses attention-based LSTM networks (At-LSTM) to perform the same tasks. The third method utilizes two LSTM networks in parallel, one for forecasting the time of the trip and the other for predicting the distance. The output of each LSTM is then concatenated through fully connected layers. Finally, the last model is based on two parallel At-LSTMs, where similarly, each At-LSTM predicts time and distance separately through fully connected layers. Among the proposed methods, the most advanced one, i.e., parallel At-LSTM, predicts the next trip's distance and time with 3.99% error margin where it is 23.89% better than LSTM, the first method. We also propose TimeSHAP as an explainability method for understanding how the networks perform learning and model the sequence of information.
- Transportation > Ground > Road (1.00)
- Energy > Power Industry (1.00)
- Automobiles & Trucks (0.96)
- Transportation > Electric Vehicle (0.70)
Shift of Pairwise Similarities for Data Clustering
Several clustering methods (e.g., Normalized Cut and Ratio Cut) divide the Min Cut cost function by a cluster-dependent factor (e.g., the size or the degree of the clusters), in order to yield a more balanced partitioning. We, instead, investigate adding such regularizations to the original cost function. We first consider the case where the regularization term is the sum of the squared size of the clusters, and then generalize it to adaptive regularization of the pairwise similarities. This leads to shifting (adaptively) the pairwise similarities which might make some of them negative. We then study the connection of this method to Correlation Clustering and then propose an efficient local search optimization algorithm with fast theoretical convergence rate to solve the new clustering problem. In the following, we investigate the shift of pairwise similarities on some common clustering methods, and finally, we demonstrate the superior performance of the method by extensive experiments on different datasets.
- North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
- North America > United States > Kansas (0.04)
- Europe > Switzerland > Zürich > Zürich (0.04)
- (3 more...)
Memory-Efficient Sampling for Minimax Distance Measures
Hoseini, Fazeleh Sadat, Chehreghani, Morteza Haghir
Learning a proper representation is usually the first step in every machine learning and data analytic tasks. Some recent representation learning methods have been developed in the context of deep learning [1], which are highly parameterized and require a huge amount of labeled data for training. On the other hand, there are methods that learn a proper representation in an unsupervised way and usually do not require learning free parameters. A category of unsupervised representations and distance measures, called link-based distance [2, 3], take into account all the paths between the objects represented in a graph. These distance measures are often obtained by inverting the Laplacian of the base distance matrix in the context of Markov diffusion kernel [2].
Hierarchical Correlation Clustering and Tree Preserving Embedding
We propose a hierarchical correlation clustering method that extends the well-known correlation clustering to produce hierarchical clusters. We then investigate embedding the respective hierarchy to be used for (tree preserving) embedding and feature extraction. We study the connection of such an embedding to single linkage embedding and minimax distances, and in particular study minimax distances for correlation clustering. Finally, we demonstrate the performance of our methods on several UCI and 20 newsgroup datasets.
- Asia > Afghanistan > Parwan Province > Charikar (0.04)
- North America > United States > Kansas (0.04)
- Europe > Sweden > Vaestra Goetaland > Gothenburg (0.04)
- (2 more...)